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SEQUENCE LISTING 

C--> 3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Bandman, Olga 

6 Corley , Neil C . 

7 Guegler, Karl J. 

8 Lai, Preeti 

C--> 10 (ii) TITLE OF INVENTION: PROSTATE -ASSOCIATED ETS PROTEIN 

12 (iii) NUMBER OF SEQUENCES: 4 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE : Incyte Pharmaceuticals , Inc . 

16 (B) STREET: 3174 Porter Dr. 

17 (C) CITY: Palo Alto 

18 (D) STATE: CA 

19 (E) COUNTRY: USA 

20 (F) ZIP: 94304 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette 
2 4 (B) COMPUTER: IBM Compatible 
25 (C) OPERATING SYSTEM: DOS 

2 6 (D) SOFTWARE: FastSEQ for Windows Version 2.0 

2 8 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/866,3 56 

C--> 30 (B) FILING DATE: 29-May-2001 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/055,113 

34 (B) FILING DATE: 1998-04-03 

38 (viii) ATTORNEY/AGENT INFORMATION: 

39 (A) NAME: Billings, Lucy J. 

40 (B) REGISTRATION NUMBER: 36,749 

41 (C) REFERENCE/DOCKET NUMBER: PF-0501 US 
4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: 650-855-0555 

45 (B) TELEFAX: 650-845-4166 

46 (C) TELEX: 
4 9 (2) INFORMATION FOR SEQ ID NO : 1: 

51 (i) SEQUENCE CHARACTERISTICS: 

52 (A) LENGTH: 335 amino acids 

53 (B) TYPE: amino acid 

54 (C) STRANDEDNESS : single 

55 (D) TOPOLOGY: linear 

57 (vii) IMMEDIATE SOURCE: 

58 (A) LIBRARY: PROSTUT12 

59 (B) CLONE: 1813005 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

64 Met Gly Ser Ala Ser Pro Gly Leu Ser Ser Val Ser Pro Ser His Leu 

65 1 5 10 15 

66 Leu Leu Pro Pro Asp Thr Val Ser Arg Thr Gly Leu Glu Lys Ala Ala 
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100 Arg Leu Trp Gly He Arg Lys Asn Arg Pro Ala Met Asn Tyr Asp Lys 

101 290 295 300 

102 Leu Ser Arg Ser He Arg Gin Tyr Tyr Lys Lys Gly He He Arg Lys 

103 305 310 315 320 

104 Pro Asp He Ser Gin Arg Leu Vai Tyr Gin Phe Vai His Pro He 

105 325 330 335 
107 (2) INFORMATION FOR SEQ ID NO : 2: 

109 (i) SEQUENCE CHARACTERISTICS: 

110 (A) LENGTH: 1905 base pairs 

111 (B) TYPE: nucleic acid 

112 (C) STRANDEDNESS: single 

113 (D) TOPOLOGY: linear 

115 (vii) IMMEDIATE SOURCE: 

116 (A) LIBRARY: PROSTUT12 

117 (B) CLONE: 1813005 

119 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 
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121 CTCGAGCCGC GGCTGTCTGA CTTCCTCCCA GCACATTCCT GCACTCTGCC GTGTCCACAC 6 0 

12 2 TGCCCCACAG ACCCAGTCCT CCAAGCCTGC TGCCAGCTCC CTGCAAGCCC CTCAGGTTGG 12 0 

12 3 GCCTTGCCAC GGTGCCAGCA GGCAGCCCTG GGCTGGGGGT AGGGGACTCC CTACAGGCAC 180 

124 GCAGCCCTGA GACCTCAGAG GGCCACCCCT TGAGGGTGGC CAGGCCCCCA GTGGCCAACC 24 0 

12 5 TGAGTGCTGC CTCTGCCACC AGCCCTGCTG GCCCCTGGTT CCGCTGGCCC CCCAGATGCC 300 

12 6 TGGCTGAGAC ACGCCAGTGG CCTCAGCTGC CCACACCTCT TCCCGGCCCC TGAAGTTGGC 36 0 

12 7 ACTGCAGCAG ACAGCTCCCT GGGCACCAGG CAGCTAACAG ACACAGCCGC CAGCCCAAAC 42 0 

128 AGCAGCGGCA TGGGCAGCGC CAGCCCGGGT CTGAGCAGCG TATCCCCCAG CCACCTCCTG 4 80 

12 9 CTGCCCCCCG ACACGGTGTC GCGGACAGGC TTGGAGAAGG CGGCAGCGGG GGCAGTGGGT 54 0 

130 CTCGAGAGAC GGGACTGGAG TCCCAGTCCA CCCGCCACGC CCGAGCAGGG CCTGTCCGCC 6 00 

131 TTCTACCTCT CCTACTTTGA CATGCTGTAC CCTGAGGACA GCAGCTGGGC AGCCAAGGCC 6 60 

132 CCTGGGGCCA GCAGTCGGGA GGAGCCACCT GAGGAGCCTG AGCAGTGCCC GGTCATTGAC 72 0 

13 3 AGCCAAGCCC CAGCGGGCAG CCTGGACTTG GTGCCCGGCG GGCTGACCTT GGAGGAGCAC 7 80 
134 TCGCTGGAGC AGGTGCAGTC CATGGTGGTG GGCGAAGTGC TCAAGGACAT CGAGACGGCC 84 0 

13 5 TGCAAGCTGC TCAACATCAC CGCAGATCCC ATGGACTGGA GCCCCAGCAA TGTGCAGAAG 9 00 

136 TGGCTCCTGT GGACAGAGCA CCAATACCGG CTGCCCCCCA TGGGCAAGGC CTTCCAGGAG 960 

137 CTGGCGGGCA AGGAGCTGTG CGCCATGTCG GAGGAGCAGT TCCGCCAGCG CTCGCCCCTG 102 0 

138 GGTGGGGATG TGCTGCACGC CCACCTGGAC ATCTGGAAGT CAGCGGCCTG GATGAAAGAG 1080 

139 CGGACTTCAC CTGGGGCGAT TCACTACTGT GCCTCGACCA GTGAGGAGAG CTGGACCGAC 114 0 

14 0 AGCGAGGTGG ACTCATCATG CTCCGGGCAG CCCATCCACC TGTGGCAGTT CCTCAAGGAG 12 00 
141 TTGCTACTCA AGCCCCACAG CTATGGCCGC TTCATTAGGT GGCTCAACAA GGAGAAGGGC 12 6 0 
14 2 ATCTTCAAAA TTGAGGACTC AGCCCAGGTG GCCCGGCTGT GGGGCATCCG CAAGAACCGT 132 0 
14 3 CCCGCCATGA ACTACGACAA GCTGAGCCGC TCCATCCGCC AGTATTACAA GAAGGGCATC 1380 
14 4 ATCCGGAAGC CAGACATCTC CCAGCGCCTC GTCTACCAGT TCGTGCACCC CATCTGAGTG 144 0 
14 5 CCTGGCCCAG GGCCTGAAAC CCGCCCTCAG GGGCCTCTCT CCTGCCTGCC CTGCCTCAGC 1500 
14 6 CAGGCCCTGA GATGGGGGAA AACGGGCAGT CTGCTCTGCT GCTCTGACCT TCCAGAGCCC 156 0 
14 7 AAGGTCAGGG AGGGGCAACC AACTGCCCCA GGGGGATATG GGTCCTCTGG GGCCTTCGGG 1620 
14 8 ACCCTGGGGC AGGGGTGCTT CCTCCTCAGG CCCAGCTGCT CCCCTGGAGG ACAGAGGGAG 16 80 
14 9 ACAGGGCTGC TCCCCAACAC CTGCCTCTGA CCCCAGCATT TCCAGAGCAG AGCCTACAGA 174 0 

150 AGGGCAGTGA CTCGACAAAG GCCACAGGCA GTCCAGGCCT CTCTCTGCTC CATCCCCCTG 1800 

151 CCTCCCATTC TGCACCACAC CTGGCATGGT GCAGGGAGAC ATCTGCACCC CTGAGTTGGG 186 0 

152 CAGCCAGGAG TGCCCCCGGG AATGGATAAT AAAGATACTA GGCGC 190 5 
154 (2) INFORMATION FOR SEQ ID NO : 3: 

156 (i) SEQUENCE CHARACTERISTICS: 

157 (A) LENGTH: 114 amino acids 

158 (B) TYPE: amino acid 

159 (C) STRANDEDNESS: single 
16 0 (D) TOPOLOGY: linear 

162 (vii) IMMEDIATE SOURCE: 

163 (A) LIBRARY: GenBank 

164 (B) CLONE: 157196 

166 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

168 Thr Asn Ala Ser Asn Gly Gly Thr Ala Thr Val Lys Arg Pro Asn Gly 

169 15 10 15 

170 Gly Arg Thr Gly Gly Gly Gly Ser His He His Leu Trp Gin Phe Leu 

171 20 25 30 

172 Lys Glu Leu Leu Ala Ser Pro Gin Val Asn Gly Thr Ala He Arg Trp 

173 35 40 45 

174 He Asp Arg Ser Lys Gly He Phe Lys He Glu Asp Ser Val Arg Val 
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230 245 250 255 

231 Ser Pro Arg Gin Glu Ser Thr Arg Val lie Gin Leu Met Pro Ser Pro 

232 260 265 270 

233 lie Met His Pro Leu lie Leu Asn Pro Arg His Ser Val Asp Phe Lys 

234 275 280 285 

235 Gin Ser Arg Leu Ser Glu Asp Gly Leu His Arg Glu Gly Lys Pro lie 

236 290 295 300 

237 Asn Leu Ser His Arg Glu Asp Leu Ala Tyr Met Asn His lie Met Val 

238 305 310 315 320 

239 Ser Val Ser Pro Pro Glu Glu His Ala Met Pro lie Gly Arg lie Ala 

240 325 330 335 

241 Asp Cys Arg Leu Leu Trp Asp Tyr Val Tyr Gin Leu Leu Ser Asp Ser 

242 340 345 350 

243 Arg Tyr Glu Asn Phe lie Arg Trp Glu Asp Lys Glu Ser Lys lie Phe 

244 355 360 365 

245 Arg lie Val Asp Pro Asn Gly Leu Ala Arg Leu Trp Gly Asn His Lys 

246 370 375 380 

24 7 Asn Arg Thr Asn Met Thr Tyr Glu Lys Met Ser Arg Ala Leu Arg His 
248 385 390 395 400 

24 9 Tyr Tyr Lys Leu Asn lie lie Arg Lys Glu Pro Gly Gin Arg Leu Leu 

250 405 410 415 

251 Phe Arg Phe Met Lys Thr Pro Asp Glu lie Met Ser Gly Arg Thr Asp 

252 420 425 430 

253 Arg Leu Glu His Leu Glu Ser Gin Glu Leu Asp Glu Gin lie Tyr Gin 

254 435 440 445 

255 Glu Asp Glu Cys 

256 450 
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